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ABSTRACT 

We introduce the contour process to describe the geometrical properties of merger 
trees. The contour process produces a one-dimensional object, the contour walk, which 
is a translation of the merger tree. We portray the contour walk through its length 
and action. The length is proportional to to the number of progenitors in the tree, and 
the action can be interpreted as a proxy of the mean length of a branch in a merger 
tree. 

We obtain the contour walk for merger trees extracted from the public database of 
the Millennium Run and also for merger trees constructed with a public Monte-Carlo 
code which implements a Markovian algorithm. The trees correspond to halos of final 
masses between 10^^h~^ Mq and lQ^^h~^ M©. We study how the length and action of 
the walks evolve with the mass of the final halo. In all the cases, except for the action 
measured from Markovian trees, we find a transitional scale around 3 x 10^^h~^ M©. 
As a general trend the length and action measured from the Markovian trees show a 
large scatter in comparison with the case of the Millennium Run trees. 
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1 INTRODUCTION 

The current paradigm of large scale structure formation in 
the Universe is hierarchical, meaning that structures are 
formed by merger aggregation. 

The dominant driver for this merger dynamic is thought 
to be the non-baryonic and non-coUisional cold dark matter. 
Therefore, research in large scale structure formation is prin- 
cipally done through the study of dark matter structures, 
which on large scale s are thought to serv e as scaffolding for 
baryonic structures (jSpringel et al.|[200^ . 

The dark matter merger process has been usually rep- 
resented by a tree, keeping the analogy with the genealog- 
ical tree of an individual. Great effort has been put into 
the methods of construction of merger trees, as they are 
a way to understand dark matter aggregation and are a 
necessary input for codes of semi-analytic galaxy forma- 
tion. The methods ran ge from t he an alytical approach of 
Monte-Carlo methods (|Zentneij l|2007l ) for a review) pass- 
ing through hybrid approaches that mix numerical realiza- 
tions of a de nsity field with ana lytical approximations for 
its evolution (jM onaco et al.ll2007l ) to the fully numerical ap- 
proach that identifies the dark matter halos from different 
snapsh ots in a A^-body sim ulation to construct the merger 
historv (|Hatton et al.ll2003l ). 
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Usually, the validity of the trees constructed in the ana- 
lytical and hybrid way is stated from comp arisons with trees 
constructed from numerical simulations (ISheth fc LemsonI 
1 19991 : [Parkinson et al.ll2007l : iNeistein fc Dekelll200'iD . Unfor- 
tunately, the quantities used to compare trees from two ap- 
proaches usually sacrifice the complexity inherent to the tree 
structure in the sake of simple tests. The most common sim- 
plification is to select one branch of the tree (the most mas- 
sive) to make the analysis. Another approach, measures the 
abundance of structures of a given mass among the halos in 
all the merger tree. In all these cases the geometrical infor- 
mation is suppressed, mostly because of the lack of simple 
structures to describe that kind of information. 

In this paper we present a new way, in the astrophys- 
ical context, to translate the geometrical information from 
a merger tree into a 1-dimensional structure. The transla- 
tion is based on the encoding of the tree information into its 
contour walk. 

We apply this description to the merger trees from a 
large dark matter num erical simulation, the Millennium Run 
(|Springel et al.ll2005l ). We use its public database to select 
halos in different bin masses at redshift z — to extract 
its merger histories and build the contour walk. We analyze 
each tree in terms of two simple statistics extracted from 
these walks. 

We have also performed this kind of analysis on 
merger trees obtained with the algorithm described by 
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iNeistein fc Dekell l|200iD , using the source code they kindly 
made public. With this code we have constructed trees with 
two resolutions for the minimum mass halo, one mimicking 
the Millennium Run, and the other with nearly 8 times lower 
resolution. 

This paper is divided as follows. In Section 2 we explain 
the construction of a contour walk from a merger tree. In 
Section 3 we present the simulation that produced the public 
data we used in this paper and the Monte Carlo code for 
the construction of Markovian merger trees. In Section 4 
we perform an immediate implementation of these concepts 
to the available merger trees. We calculate global statistics 
from the walks, and discuss its possible physical meaning. In 
the last section we discuss our results and suggest how the 
tool we have proposed can be used to tackle more complex 
questions about merger trees. 



2 TREES AND WALKS 

The merger trees in the current dark matter paradigm are 
simple, as only merging is allowed. From the mathematical 
point of view this tree structure corresponds to a Galton- 
Watson tree. 

Galton- Watson trees are genealogical trees of a popu- 
lation that evolves according to very simple rules. At gen- 
eration 0, the population starts with one individual, called 
the ancestor. Then each individual in the population has, 
independently of the others, a random number of children 
according to an offspring probability distribution. In the 
structure formation context we identify the ancestor with 
the halo at redshift zero, and the offsprings with the par- 
ents of the halo. 

These trees can be coded by a discrete path around the 
contour of the tree (|Le Gallll2005D . This contour process is 
easy to visualize. We imagine the displacement of a particle 
that starts at time zero from the root of the tree (a halo at 
redshift zero) and then visit the first not yet visited parent 
halo, if any, and if none, the successor of the halo. We do 
so until all the members in the tree have been explored and 
the particle has come back to the origin. 

This process is illustrated in the Fig[l] Where the dif- 
ferent values of t correspond to the discrete snapshots for 
which we have the halo information, and Ti is the imagi- 
nary time giving the pace of the particle around the tree. 
We point out that one has to define some order between 
the halos at a given point in the tree in order to define a 
unique way to walk the tree, a criteria to decide to which 
parent should jump the particle. This ordering criterium in 
our case is naturally imposed by the masses of the halos, as 
we always visit first the most massive progenitor. 

We can express this as visiting all the halos in the tree in 
a depth-first fashion, visiting first the most massive branch 
at every time. 



3 N-BODY SIMULATION AND 
MONTE-CARLO CODE 

3.1 The Millennium Run 




The Millennium Run (jSpringel et al ] 120051 ) (MR hereaft^er) 
was carried out by the Virgo Consortium in 2004 at the Max 



Figure 1. Illustration of the contour process for a simple merger 
tree. The upper panel represents a merger tree formed by an 
evolving halo population at different timesteps t (horizontal 
lines), time flowing from top to bottom. We start from the 
last halo at the bottom, and visit the halos in the tree at unit 
timesteps r. The most massive progenitor is always visited first. 
We go around until we go back to the initial point. The bottom 
part of the figure shows the contour walk, its values correspond 
to the time t where the particle was located at time t. The tree 
structure can be recovered completely from the contour walk as 
indicated by the solid lines in the lower panel. 



Planck's Society's supercomputer center in Garching. It is 
a dark matter only simulation containing 2160^ particles, it 
was evolved from z = 127 to the present-day inside a cubic 
region of 500 Mpc h~^ on a side, the individual particle mass 
is 8.6 X 10* /i"^ Mq. It adopted ildm = 0.205, fit = 0.045 
for the current densities of dark matter and baryons in a 
Lambda Cold Dark Matter cosmological model, furthermore 
it used h = 0.73 for the present dimensionless value of the 
Hubble constant, erg = 0.9 for the rms linear mass fiuctua- 
tion in a sphere of radius 8/i~^Mpc extrapolated to 2 = 0, 
and n = 1 for the slope of the primordial fiuctuation spec- 
trum. 

The simulati on was run us ing the TREE-PM N-body 
code GADGET2 (|Springelll2o"05l ), 64 outputs were stored at 
times spaced approximately equally in the logarithm of the 
expansion factor at early times and at approximately 200 
Myr intervals after z — 1. At run time all collapsed halos 
with at least 20 particles (1.7 x 10^°/t~ ^ Mq) were iden - 
tified using a friends-of-friends (FOF) (|Davis et all Il985l ) 
group-finder with linking length b = 0.2. P ost processing 
with the substructure algorithm SUBFIND (jSpringel et al.l 





Figure 2. Examples of contour walks constructed from merger trees extracted from the Millennium Run. Each contour corresponds to 
a halo of different mass. ~ lO^'^h'^ Mq (upper left), ~ lO^^h'^ Mq (upper right), ~ lO^^/j-i (lower left), ~ lO^^h-^ Mq (lower 
right). 



I2OOII ) allowed a detection and measurement of the resolved 
sub-halos. This in turn allowed trees to be built with detailed 
assembly histories for every object and its substructure. 

The postprocessed data have been publicly 
disseminated through an interactive database: 
|http : / /www ■ g-vo ■ org/Millennium/MyDB The partic- 
ular structure of the database design which allows 
efficient querying for mer ger trees was implemented by 
iLemson &: Springell l|200d ). Conveniently enough, the 
structure for merger trees in the database is based on 
a depth-first ordering, making that the output of a tree 
query is an incomplete version of the contour walk In 
terms of the lower panel in Fig[l] only the upward arrows 
offspring-progenitor exist. 



3.2 Markovian Trees 

The authors iNeistein fc Dekell (|2007l ) recently proposed and 
algorithm for the construction of merger trees based on a 
Markovian approach. This approach to merger trees means 
that any halo of a given mass M at time t has a progenitor 
probability distribution depending only on M and t. This 



scheme is explicitly independent of the large scale environ- 
ment that could be defined for a halo. 

Their merger trees are parameterized by a time variable 
uj = Sc/D{z) and a mass variable S{M) = a^{M), where 
5c ~ 1.69 and D{z) is the cosmological linear growth rate 
and (7^(M) is the variance of the initial density fluctuation 
field, linearly extrapolated to z = and smoothed using 
a window function that corresponds to a mass M. These 
two variables are also the n atural variable s in the Extended 
Press-Schechter formalism (|Zentnerll2007l ). 

Their approach is the following. First, from the MR data 
they find the conditional probabilities for the masses of the 
main progenitors at a past time t' as a function of halo mass 
M at a present time t. With this conditional probabilities 
they build the main progenitor history which by construc- 
tion reproduce to a good extent the MR data. Then, they 
extended this approach to the construction of a full merger 
tree. The extension included some additional heuristic rules 
based again on the premise of a fair match to the total pro- 
genitor number density dN/dM from the Millennium Run 
data. Nevertheless, the lack of a true physical motivation 
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cannot ensure the reconstruction of the correct full joint 
distribution of progenitors. 

Even if N-body merger trees are not Markovian, the 
proposed algorithm manages to reproduce some tree prop- 
erties, specially those related with the main progenitor but 
also the total mass distribution in all the progenitors. The 
detected inaccuracies come from the fact that the estimation 
of the average mass for the second progenitor is not reliable. 

In spite of that, we have decided to use these author's 
public available code given its explicit effort to reproduce 
the MR data. 



4 EXPERIMENT SETUP AND NOTATION 
4.1 Tree Selection 

We make use of two kind of trees. Trees extracted from 
the MR public data base and trees constructed with the 
public code implementing the Markovian approach. In both 
cases we concentrate on halos with masses greater than 
10"7i-1Mq. 

For the MR trees we selected all the halos in the sim- 
ulation box with a given mass Mh = logio(M2oo/10^°/i-i 
Mq) at redshift z — 0, for 28 different values of Mh in bins 
of width AAIh ~ 0.002 dex, where M200 is the halo mass 
measured within the radius where the halo has an overden- 
sity 200 times the critical density of the simulation. The bins 
are spaced by 0.1 dex, the least massive bin corresponds to 
Mh = 1.3 and has 6028 dark matter halos and the most 
massive bin corresponds to Mh = 4.0 and has 17 dark mat- 
ter halos. We also performed measurements for the 15 most 
massive bins with AMh = O.Oldex, which provided us with 
nearly five times more halos per bin. The results we ob- 
tained in that case are basically the same than in the case 
AMh ~ 0.002. In this paper we only report and discuss 
the results for the selection with the smaller AAIh = 0.002 
sample. 

For the Markovian trees we made two different runs 
changing the minimal halo mass. The first one mimicked the 
MR and its minimal mass was 1.7xl0^°h-^ Mq. The second 
run used a higher minimal mass of 1.0 X 10" /i"^ Mq. We will 
refer to these runs as the high resolution and low resolution 
runs, respectively. In both cases we also constructed merger 
trees for halo mass bins spaced by 0.1 dex. The mass bins for 
the high resolution run are the same as in the MR case, for 
the low resolution run we can only describe trees from the 
bin Mh = 1.6 up to Mh = 4.0. For each bin we construct 
1000 trees, each one with 100 steps in a; with Auj — 0.1. 
Which is equivalent to have trees described from redshifts 
z ^8. 



4.2 Notation 

The snapshots in the dark matter simulation will be labeled 
by ti, where i ranges from to 63. The snapshot to corre- 
sponds to the z — 0. We select a halo in the snapshot to 
to extract its merger tree. The contour walk, as described 
in Section [21 can be visualized as a dynamical process of a 
particle going around the tree, at unit time-steps stopping 
at each node in the tree and recording the snapshot ti to 



which it belongs. The discrete variable counting the imagi- 
nary time of the particle walking the tree is noted Ti . 

We write the walk as a sequence of discrete values 
{xo, . . - Xn}, corresponding to different values of the discrete 
intermediary times {tq, . . . , tjv}. A*" is the total length of the 
walk, and every Xi can take values from the possible snap- 
shots to . . . tes. In the case of the Markovian trees the values 
the walk can take vary from ljo to cugg, consistent with the 
fact that we have described these trees at 100 points equally 
spaced by Acu = 0.1. 

For each merger tree we compute two statistics: its 
length and action. The length corresponds to the number of 
points in the walk. The action, which will be defined later, 
is an statistic based on the first derivatives of the walk. We 
offer in Fig[2] a feeling on the merger walks for halos in four 
different mass bins. 



5 SIMPLE STATISTICS FROM THE WALKS 
5.1 Walk Length 

The Figure [3] shows the walk length for halo merger trees 
as a function of halo mass. At low masses where the growth 
should be dominated by mass accretion and not by merg- 
ers, A'^ oc M'''^ . At high masses where the growth starts to 
be controlled by mergers A'^ oc M"'*. The transitional scale 
corresponds to halos for a halo mass 1.6 X IO^^^-^Mq. 

This transitional scale should depend on the resolu- 
tion of the dark matter simulation. When smaller halos 
are resolved, the merger trees will be populated with more 
branches of this lower mass halos, and one could start to see 
that all the growth is done trough mergers. 

Nevertheless, if we intend to study galaxy formation us- 
ing merger trees, this suggested transitional scale may have 
some significance. In the favored paradigm of galaxy forma- 
tion, not every halo can harbor galaxies at a cosmic epoch. 
Only halos that can cool efficiently the gas may hold star 
forming galaxies. A lower limit is imposed by the UV back- 
ground from star formation, which sets a low baryonic frac- 
tion in halos of masses below ~ 1 x W^" h~^MQ, which is 
roughly the minimal resolution for halo detection in the Mil- 
lennium Run. If one aims to study galaxy formation, the UV 
background sets naturally a minimal mass of the progenitors 
that should be included in the merger history of a dark mat- 
ter halo. In any case, we decided to explore the influence of 
the resolution using the results from the Monte-Carlo code. 

In the case of merger trees constructed with a resolution 
mimicking the MR we find again a transitional scale (Fig|3| 
around 4.0 x lO^^/i^^M©. The biggest difference in compar- 
ison with the A''-body results are the exponents describing 
the length of the walk as a function of halo mass. In the 
Monte Carlo case both exponents are closer to A ~ 0.9. This 
results would favor a view from which there is still room left 
for a better description of smooth accretion into the Markov 
description. 

We extend the statistical description of the contour walk 
by the quantification of the its waviness. We borrow from 
statistical mechanics the concept of action, which is used in 
the numerical calculation of path integrals (|Krauthll2006l ). 

The action can be defined as the potential energy in- 
vested in bending the path, comparing it with a charac- 
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Figure 3. Upper panel Walk length Nf^aio for halo merger trees 
extracted from the Millennium Run as a function of the logarithm 
of the final halo mass. The solid lines show two different power law 
trends for the relation Ni^aio c< ^^halo' ^ ~ ^'^'^ ^ ~ ^^^^ "^^^ 
intersection of the two lines is located at 1.6 X 10^'^/i~^Mq. The 
dashed line simply passes through the two extreme mass points 
in the plot. Lower panel The same results as in the upper panel, 
but this time the measured values are normalized to the functional 
dependence of the dashed line in the upper panel. This enhances 
the features in the curve and allows an easier determination of 
the transitional mass-scale. 
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Figure 4. Upper panelSame as Fig[3]for merger trees constructed 
with the Markovian algorithm. Black symbols show the high res- 
olution run, and white symbols the low resolution run. Upper 
panel The lines show again two different power law trends for the 

relation N^alo oc ^halo^ ^ ~ ^-^^ '^"'^ ^ ~ ^-^^^ '^'^'^ more 
steeper than the A = {0.5,0.8} found in the Millennium merger 
trees. Lower panel The transitional mass-scale is roughly located 
at 4.0 X lO^'^/i~^M0 with a weak dependence on the run's reso- 
lution. The dispersion for the measurements from the Markovian 
trees is noticeably larger than the measurements coming from 
Millennium Run data. 



teristic temperature. If there is a walk defined by points 
{xi . . .xn}, the action is usuaUy defined as 



{Xj+i - Xj) 

2/3 



(1) 



where /3 plays the role of a temperature or imaginary 
time depending on the context. If we take that definition 
applied to our case: 



Ti + \ — Ti 



(2) 



as we have unit steps, r^+i — n is equal to [xi+i — s^l, 
making the action 5* equal to the walk length. Therefore, we 
decide to define a normalized action only from the extreme 
points in the contour walk. We take this extreme points, y^, 
as a sample of the contour walk where the derivative at point 
Xi, defined as (a;i+i — Xi_i)/2, is equal to zero. We note the 
set of times corresponding to the vanishing points of the 
derivative as £ (as in Extreme). 



If we now define the action on the sampled walk {y ; | Ti G 



1 {Vi+l-y^f 



■E 



(3) 



which makes the action S equal to the walk length di- 
vided by the number of extreme points. In the context of 
merger trees, each peak in the walk corresponds to a branch 
in the tree, making the contribution to the action roughly 
proportional to the length of that branch. Therefore, the 
normalized action could be loosely interpreted as a proxy 
for the mean length of one branch in the tree. 



5.2 Walk Action 

In Fig[5] we show the normalized action as the function of 
halo mass for the MR and the Monte-Carlo code. For MR 
trees we find a distinction between the behavior of the ac- 
tion for low and high mass halos. This time the two mass 
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Figure 5. Left panel Normalized wallc action S, Eq.l(3ll, as a function of halo mass for merger trees extracted from the Millennium Run. 
We find again a transitional mass-scale around 1.0 X lO^^/i~^M0, which could be interpreted as the mean branch length reaching a 
maximum for that halo mass. Right panel The same as the left panel for merger trees constructed with the Markovian algorithm. Black 
symbols refer to the high resolution run, white symbols to the low resolution run. The transitional mass-scale is replaced by a plateau. 
Here the mass resolution plays an important role shaping the general trend for the action S. As in the length case, the dispersion in the 
measurements from Markovian trees is larger than the measurements coming from Millennium Run data. 



extremes share the same action values, meaning that we can 
identify the same mean branch length for the two extremes 
masses. For intermediate masses the action increases. In the 
case of the MR trees, the action achieves a maximum for 
a mass scale 1.0 x 10^^h~^ M©. For Monte Carlo trees the 
maximum takes the form of a broader plateau ranging al- 
most two orders of magnitude between 4.0 x W^^h~^ Mq 
and 4.0 x lO^^/i"^ Mq. 

Perhaps the most distinguishing factor between the MR 
and Markovian approach is that the dispersion in the latter 
is much higher that in the former. A fact that can be inter- 
preted as a higher variability in the geometry of Markovian 
trees. 



6 DISCUSSION 

We introduced from the mathematical literature the contour 
process of a tree, and we applied this concept to the descrip- 
tion of merger trees. We used a large dark matter simulation 
(the Millennium Run) and a Monte-Carlo code (implement- 
ing a Markovian approach) to obtain merger trees in these 
two approximations. Furthermore, the Markovian trees were 
obtained with two different values for the minimal mass of a 
parent halo. One resolution mimicked exactly the MR, and 
the other resolution had a 8 times more massive minimal 
halo mass. We refer to these Markovian runs as the high 
resolution and low resolution runs respectively. 

We extracted simple statistics from these walk: the 
length (proportional to the total number of halos in the 
tree) and the action S (which can be loosely interpreted as 
the mean longitude of a branch). We report our results of 
walk length and action emphasizing its evolution as a func- 
tion of the physical halo mass, and not its absolute values. 

From the length and the action, we found in the Millen- 
nium Run a transitional mass scale at ~ 3.0 x lO^^/i"^ M©. 
In the case of the walk length, N, this transitional mass-scale 
marks the change between a dependence N oc M"'^ for low 
halo masses and N ocM"-^ for high mass halos, where M is 



the mass of the final dark matter halo in the tree. With the 
action, the scale marks the highest value for the action as 
function of the halo mass. 

For the Markovian trees the dependence of the walk 
length on the halo mass is almost the same everywhere, al- 
though much steeper than the dependence found in the Mil- 
lennium Run. Nonetheless, we found the same transitional 
scale from the length statistics for the two resolutions. It 
does not prove that the transitional scale is independent on 
the resolution mass (in fact, it should be dependent) but sug- 
gests that the scale does not have a strong dependence with 
the minimal mass resolution used to describe the merger 
tree. The evolution of the action as a function of halo mass 
is completely different from the MR case. The Markovian 
run with mimicking MR resolution does not show a sharp 
transitional scale, instead it shows a large plateau ranging 
for two orders of magnitude in mass. 

Perhaps the biggest difference between the two ways of 
constructing the trees is that the Markovian approach al- 
ways show a bigger dispersion on its contour walk statistics, 
which seem best defined in the Millennium Run, judging 
from its low dispersion. This could be interpreted in fact as 
a higher geometrical variability in the Markovian trees. 

We have shown how simple statistics from the contour 
walk can give a new handle on the description of merger 
tree, exploring the geometrical information of merger trees. 
Even if the length of the walk could have been obtained 
without an intermediating contour process, the approach of 
extracting information from higher order statistics, as was 
the case for the normalized action S, can be extended in 
complexity. For instance, is relevant to point out that the 
information of the mass in each node is also encoded in the 
contour walk. The mass information is used as an ordering 
criteria to walk the tree. One could define sections defined by 
{Ti\x{Ti = ti)}, i.e. the points where the walk touches the 
next to last snapshot, and the length ratio of the sections 
delimited by these points would include information of the 
mass ratio of the mergers at time ti . In general the study 
of crossing paths, defined as the the section of the contour 



The Coarse Geometry of Merger Trees in ACDM 7 



walks within some boundary a < {xi} < b might produce 
useful statistics to further classify the complex merger trees 
of massive halos. 
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